ExpECT: an expanded error categorisation method for text input
نویسندگان
چکیده
This paper describes an empirical study on typing errors made by children during a text copy exercise. The literature on text input errors is first examined, focussing on studies of errors that occur during keyboard typing. A study of errors made by children during typing is described and the results from this study are analysed using visual inspection and already published error categorisation methods. These methods are compared with respect to the types and number of errors categorised and uncategorised. We identify and define new kinds of typing errors and use these, together with previously defined error types, to outline an expanded and more detailed method (ExpECT) for the classification of typing errors. ExpECT is compared with the previously examined categorisation methods and is shown to be a more thorough and broader method for the analysis of typing errors.
منابع مشابه
Text input error categorisation: solving character level insertion ambiguities using Zero Time analysis
A review of literature on text input error categorisation revealed the need for a formal method to assist in solving ambiguities. This paper proposes a method of solving one such set of ambiguities, those caused by insertion of an extra letter. The method uses two rules: the Zero Time rule and Impossible NT/CT-Mu rule to establish whether the extra letter was inserted with another letter, or in...
متن کاملImproving Biomedical Text Categorisation with NLP
Background: Text categorisation has been used in bioinformatics to help identify documents containing protein-protein interactions. Standard text categorisation methods have used the bag-of-words approach with little input from NLP. While this has proved effective in the past, there is some evidence that the techniques are not adequate in some biological domains. Here we examine how chunking, n...
متن کاملAutomatic Style Categorisation of Corpora in the Greek Language
In this article, a system is proposed for the automatic style categorisation of text corpora in the Greek language. This categorisation is based to a large extent on the type of language used in the text, for example whether the language used is representative of formal Greek or not. To arrive to this categorisation, the highly inflectional nature of the Greek language is exploited. For each te...
متن کاملCorrecting ‘Wrong-Column’ Errors in Text Databases
We present a novel data-driven approach for detecting and correcting errors in text databases. We focus on information that was accidentally entered in an incorrect column. Unlike machine-learning approaches to data cleaning that assume the database cells to contain atomic or numeric content, our method takes into account substrings of textual cells, and treats error detection and correction as...
متن کاملA Graph Based Methodology for the Representation and Evaluation of Text Input Strategies for Miniature and Mobile Devices
In this paper a new methodology for representing text-input strategies for miniature and mobile devices is presented. The methodology is based on representing text-input strategies as graphs. Graph representations allow different static mobile text-input strategies to be represented in a uniform manner. Further, different strategies are easily compared as the graph representation allows various...
متن کامل